Picture for Xiao Zhou

Xiao Zhou

UniAudio-Token: Empowering Semantic Speech Tokenizers with General Audio Perception

Add code
May 29, 2026
Viaarxiv icon

DiffSpot: Can VLMs Spot Fine-Grained Visual Differences in Web Interfaces?

Add code
May 28, 2026
Viaarxiv icon

OpenComputer: Verifiable Software Worlds for Computer-Use Agents

Add code
May 19, 2026
Viaarxiv icon

AcademiClaw: When Students Set Challenges for AI Agents

Add code
May 04, 2026
Viaarxiv icon

POINTS-Seeker: Towards Training a Multimodal Agentic Search Model from Scratch

Add code
Apr 15, 2026
Viaarxiv icon

Beyond Transcription: Unified Audio Schema for Perception-Aware AudioLLMs

Add code
Apr 14, 2026
Viaarxiv icon

POINTS-Long: Adaptive Dual-Mode Visual Reasoning in MLLMs

Add code
Apr 13, 2026
Viaarxiv icon

Human Values Matter: Investigating How Misalignment Shapes Collective Behaviors in LLM Agent Communities

Add code
Apr 07, 2026
Viaarxiv icon

HypeMed: Enhancing Medication Recommendations with Hypergraph-Based Patient Relationships

Add code
Mar 19, 2026
Viaarxiv icon

Social-JEPA: Emergent Geometric Isomorphism

Add code
Feb 28, 2026
Viaarxiv icon